Creation of Lexical Relations for IndoWordNet
نویسندگان
چکیده
WordNet is an electronic lexical database available on-line as a powerful resource to the researchers in the area of computational linguistics, text processing and other related areas. WordNet for Hindi language has already been developed by IIT, Bombay. The Indian languages WordNets are being created using expansion approach from Hindi WordNet under IndoWordNet project. In expansion approach, semantic relations are borrowed from the reference language, while the lexical relations need to be created for each language, as these relations are language dependent. This paper describes the process of creation of lexical relations like antonym, compounding, conjunction and gradation for IndoWordNet. A lexical creation tool has been presented in this paper with provision to create lexical relations in target language on the basis of relations created in Hindi WordNet and with another provision to create lexical relations in target language without referring to Hindi WordNet. It has been observed that lexical relations for target language can be created easily on the basis of relations created in Hindi WordNet for Hindi in-family languages, while for the languages that do not fall in the same family provision of creation of lexical relation without referring to Hindi WordNet can be used.
منابع مشابه
Tools for IndoWordNet Development
WordNet is an essential resource in Natural language processing. A lexical database like WordNet has a variety of practical applications like machine translation, information retrieval and many more. The creation of a comprehensive WordNet requires many hands and minds working collaboratively. In addition WordNet creation and maintenance demands creation of a wide range of software tools. All t...
متن کاملWord Sense Disambiguation Using IndoWordNet
Word Sense Disambiguation (WSD) is considered as one of the toughest problem in the field of Natural Language Processing. IndoWordNet is a linked structure of WordNets of major Indian languages. Recently, several IndoWordNet based WSD approaches have been proposed and implemented for Indian languages. In this chapter, we present the usage of various other features of IndoWordNet in performing W...
متن کاملIndoWordNet
India is a multilingual country where machine translation and cross lingual search are highly relevant problems. These problems require large resourceslike wordnets and lexiconsof high quality and coverage. Wordnets are lexical structures composed of synsets and semantic relations. Synsets are sets of synonyms. They are linked by semantic relations like hypernymy (is-a), meronymy (part-of), tro...
متن کاملIndoWordnet Visualizer: A Graphical User Interface for Browsing and Exploring Wordnets of Indian Languages
In this paper, we are presenting a graphical user interface to browse and explore the IndoWordnet lexical database for various Indian languages. IndoWordnet visualizer extracts the related concepts for a given word and displays a sub graph containing those concepts. The interface is enhanced with different features in order to provide flexibility to the user. IndoWordnet visualizer is made publ...
متن کاملIndowordnets help in Indian Language Machine Translation
Being less resource languages, Indian-Indian and English-Indian language MT system developments faces the difficulty to translate various lexical phenomena. In this paper, we present our work on a comparative study of 440 phrase-based statistical trained models for 110 language pairs across 11 Indian languages. We have developed 110 baseline Statistical Machine Translation systems. Then we have...
متن کامل